Probability Basics
Joint distribution
The joint distribution of two random variables
Marginal distribution
The Marginal distribution of
This is also called Sum rule / The rule of total probability.
Conditional joint
Product rule
Chain rule
Note that Markov chain can be regarded as a simplified version of the chain rule, i.e. predict by ignoring all previous events but the last one(s), so that
- application for multiple variables:
derivation:
$$
p(A|B, C) = p(A, B, C) / p(B, C)
$$
with
$$
p(A, B, C) = p(B|A, C) p(A|C) p(C), \ \ p(B, C) = p(B|C)p(C)